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GAATTCGGAG gaattattca aaacataaac ACAATAAACA ATTTGAGTAG TTGCCGCACA 60 \ 

"1 

CACACACACA CACAGCCCGT GGATTATTAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 



GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 

Met His Trp lie Lys Cys Leu Leu Thr Ala 
1.5 10 



171 



TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 
Phe lie Cys Phe Thr Val He Vol Gin Val His Ser Ser Gly Ser Phe 

15 20 25 



219 



GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 

30 35 40 



267 



GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 

45 50 55 



315 



GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 
Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 
60 65 70 



363 



ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 
Thr He Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Val He Thr Pro 
75 80 85 90 



411 



ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 
He Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 

95 100 105 



459 



AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 

110 115 120 
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CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 
Pro Gly Thr Phe Ser Leu He Vol Glu Ala Trp His Asp Thr Asn Asn 
125 130 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu He Gin Arg Leu Leu Val 
140 145 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
155 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 

175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 

190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu He He Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 
Thr Gly Trp Gin Gly Asp Tyr Cys His He Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 240 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 

255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 

lie His Gly Thr Cys Asn Lys Pro Trp Thr Cys He Cys Asn Glu Gly 

270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 

Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 

285 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 

Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 

300 305 310 
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TGT GCC AAT GGT TTC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
555 560 565 570 

TCG GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA GCG AGA 1899 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 

575 580 585 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu He Ala Val Phe Ser 

590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val He Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 

655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn lie He 

670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lys Asn Thr Trp Asp Lys Ser Val Asn Asn He Cys Ala Ser Ala Ala 
685 690 695 
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GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 8283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 

735 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 

750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg He Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 805 810 

TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 

815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 
Val Cys Gly Thr Pro His Met 

830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 
GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 
ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 
TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 
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1 GAATTCCGCT GGGAGAATGG TCTGAGCTAC CTGCGCGTCC TGCTGGGGCA TCAATGGCAA 

61 GTGGGGAAAG CCACACTGGG CAAACGGGCC AGGCCATTTC TGGAATGTGG TACATGGTGG 

121 GCAGGGGGCC CGCAACAGCT GGAGGGCAGG TGGACTGAGG CTGGGGATCC CCCGCTGGTT 

i§ I 181 GGGCAATACT GCCTTTACGC ATGAGCTGGA AAGTCACAAT GGGGGGCAAG GGCTCCCGAG 

_J 241 GGTGGTTATG TGCTTCCTTC AGGTGGC 



5 a„ 




ft ?! 

£iS_J FIG.8A 



5 At: 



1 GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 

61 CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 

121 GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 

181 AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 

241 TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 

301 GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 

361 TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGAC CAGACAGACC GGACTGGTGA 

421 GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 

481 TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 

541 GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 



FIG.8B 



1 TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 

61 CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 

121 GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 

181 CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 

241 AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 



FIG.8C 

6 



1 GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 

61 ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 

121 TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 

181 GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 

241 CAGAGCTG 



FIG.9A 



1 CTAAAGGGAA CAAAAGCNGG AGCTCCACCG CGGGCGGCNC NGCTCTAGAA CTAGTGGANN 

61 NCCCGGGCTG CAGGAATTCC GGCGGACTGG GCTCGGGCTC AGAGCGGCGC TGTGGAAGAG 

121 ATTCTAGACC GGGAGAACAA GCGAATGGCT GACAGCTGGC CTCCAAAGTC ACCAGGCTCA 

181 AATCGCTCGC CCTGGACATC GAGGGATGCA GAGGATCAGA ACCGGTACCT GGATGGCATG 

241 ACTCGGATTT ACAAGCATGA CCAGCCTGCT TACAGGGAGC GTGANNTTTT CACATGCAGT 

301 CGACAGACAC GAGCTCTATG CAT 



FIG.9B 
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G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 
Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 
15 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 

20 25 50 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn He He Thr Asp 

35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 
Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 

50 55 60 



46 



94 



142 



190 



GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 
Glu Met Ala Leu His Leu A la Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 



238 



AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 
Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 

100 105 110 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 
Phe Gin He Leu He Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 

115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 
Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 



286 



334 



382 



430 
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GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 
Gly Met Vol Ala Glu Leu He Asn Cys Gin Ala Asp Val Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 
Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met 

180 185 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 

195 200 205 

AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 
Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Arg Asp 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 
He Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 

225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 
Met His His Asp He Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys 

260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lyn His Thr Pro Met Gly Lys 

275 280 285 
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526 



574 



622 



670 



718 



766 



814 



862 
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AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 
Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
290 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 
Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 
Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 

340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 
Ser Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 

355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 
Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 
He Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 

420 425 430 

CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 
Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 

435 440 445 



FIG.11C 



ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 
Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 
450 455 460 



1390 




ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 1438 
Thr His Pro Gly He Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 

ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 1486 
He Thr Thr Pro Arg Glu Pro Leu Pro Pro He Val Thr Phe Gin Leu 
480 485 490 495 

ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 
He Pro Lys Gly Ser He Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin 

500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin 

515 520 525 



ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 
lie Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 
530 535 540 



1630 



ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 1678 

Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr He Leu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 

His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
560 565 570 575 



CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 

580 585 590 



1774 



GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 

595 600 605 



FIG.11D 




CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 

Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser 

610 615 620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 

Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 630 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 

Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Vol 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 
Tyr Ala 



1870 



1918 



1966 



2022 



GCTGAGGAAC 


AAATGAAGGT 


CATCCGGGAG 


AGAAATGAAG 


AAATCTCTGG 


AGCCAGCTTC 


2082 


TAGAGGTAGG 


AAAGAGAAGA 


TGTTCTTATT 


CAGATAATGC 


AAGAGAAGCA 


ATTCGTCAGT 


2142 


TTCACTGGGT 


ATCTGCAAGG 


CTTATTGATT 


ATTCTAATCT 


AATAAGACAA 


GTTTGTGGAA 


2202 


ATGCAAGATG 


AATACAAGCC 


TTGGGTCCAT 


GTTTACTCTC 


TTCTATTTGG 


AGAATAAGAT 


2262 


GGATGCTTAT 


TGAAGCCCAG 


ACATTCTTGC 


AGCTTGGACT 


GCATTTTAAG 


CCCTGCAGGC 


2322 


TTCTGCCATA 


TCCATGAGAA 


GATTCTACAC 


TAGCGTCCTG 


TTGGGAATTA 


TGCCCTGGAA 


2382 


TTCTGCCTGA 


ATTGACCTAC 


GCATCTCCTC 


CTCCTTGGAC 


ATTCTTTTGT 


CTTCATTTGG 


2442 


TGCTTTTGGT 


TTTGCACCTC 


TCCGTGATTG 


TAGCCCTACC 


AGCATGTTAT 


AGGGCAAGAC 


2502 


CTTTGTGCTT 


TTGATCATTC 


TGGCCCATGA 


AAGCAACTTT 


GGTCTCCTTT 


CCCCTCCTGT 


2562 


CTTCCCGGTA 


TCCCTTGGAG 


TCTCACAAGG 


TTTACTTTGG 


TATGGTTCTC 


AGCACAAACC 


2622 


TTTCAAGTAT 


GTTGTTTCTT 


TGGAAAATGG 


ACATACTGTA 


TTGTGTTCTC 


CTGCATATAT 


2682 


CATTCCTGGA 


GAGAGAAGGG 


GAGAAGAATA 


CTTTTCTTCA 


ACAAATTTTG 


GGGGCAGGAG 


2742 


ATCCCTTCAA 


GAGGCTGCAC 


CTTAATTTTT 


CTTGTCTGTG 


TGCAGGTCTT 


CATATAAACT 


2802 



FIG.11E 




TTACCAGGAA GAAGGGTGTG AGTTTGTTGT 
GTTTTATCCT TGATAGTCTA GTTACTATGA 
GTTTGGAATG TTGGAATGAC CAAGAGACAA 
CACAGGTCCC CCTACTTCCT GCCAAGCATT 
CCCAGATCTG AGCATTCTAG GCCTGTTTCA 



AACTGTTGAG CCTTTCCTTT CATATCCACA 
CCATTTAGGA CTGAACTTTC CTTAGCCCAA 

CAGATGATCA GTCTCTACTG ATTATCTTGC 
TCACACCGTG TGGTCCGTGT TACTGGTATA 
TTATATGTTC AAGTGCAGGA ATTGGAAAGT 
CTATAAGAAG GTTGGAAAAG GAGGAACTAT 
TTCTTTTCCT CTGAAGCGGC CATGACATTC 



TTTTCTGTGT ATGGGCCTGG 


TCAGTGTAAA 


2862 


CCCTCCCCAC TTTTTTAAAA 


r\ a f m a a a A r~ 

CCAGAAAAAG 
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GTTAACTCGT GCAAGAGCCA 


/~tt a r^f^r* a r*r* 

GTTACCCACC 
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CCATTGACTG CCTGTATGGA 
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ACACATTTGT 
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CTCACTCACC CAGCATATGA 
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AATCTTTCTT 
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CCCAGTATGT TCTCACTGAA 


GACATGGACT 
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TGGACTTGTT TTCTATGATC 


CAAAACAGCC 


3402 


ATAGCAGCCT TTGCTATTTT 


CTGCTACCAT 


3462 


CCTTTGGCAA CTAACGTAGA 


AACTCAACAG 
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AACATTTTCC 


TTTCCTAGAG 


TCACCTTTTA GATGATAATG GACAACTATA GACTTGCTCA 


3582 


TTGTTCAGAC 


TGATTGCCCC 


TCACCTGAAT CCACTCTCTG TATTCATGCT CTTGGCAATT 


3642 


TCTTTGACTT 


TCTTTTAAGG 


GCAGAAGCAT TTTAGTTAAT TGTAGATAAA GAATAGTTTT 


3702 


CTTCCTCTTC 


TCCTTGGGCC 


AGTTAATAAT TGGTCCATGG CTACACTGCA ACTTCCGTCC 
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AGTGCTGTGA 


TGCCCATGAC 


ACCTGCAAAA TAAGTTCTGC CTGGGCATTT TGTAGATATT 


3822 


AACAGGTGAA 


TTCCCGACTC 


TTTTGGTTTG AATGACAGTT CTCATTCCTT CTATGGCTGC 


3882 


AAGTATGCAT 


CAGTGCTTCC 


CACTTACCTG ATTTGTCTGT CGGTGGCCCC ATATGGAAAC 


3942 


CCTGCGTGTC 


TGTTGGCATA 


ATAGTTTACA AATGGTTTTT TCAGTCCTAT CCAAATTTAT 
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4002 


TGAACCAACA 


AAAATAATTA 


CTTCTGCCCT GAGATAAGCA GATTAAGTTT GTTCATTCTC 


4062 


TGCTTTATTC 


TCTCCATGTG 


GCAACATTCT GTCAGCCTCT TTCATAGTGT GCAAACATTT 


4122 


TATCATTCTA 


AATGGTGACT 


CTCTGCCCTT GGACCCATTT ATTATTCACA GATGGGGAGA 


4182 


ACCTATCTGC 


ATGGACCCTC 


ACCATCCTCT GTGCAGCACA CACAGTGCAG GGAGCCAGTG 


4242 


GCGATGGCGA 


TGACTTTCTT 


CCCCTG 
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Potentiol signol cleavage site 



MP- 
MP- 
MO 



-ALRPAL LWALLALWLC CA- 

PL LAPLLCLALL PA- 

RIGLAVLLCS LP- 



-APA HA- 
LM RG- 
•VLT QG- 



r 



MQSQRSRRRS RAPNTWICFW I NKMHAVASL PASLPLLLLT LAFANLPNIV RGTDTALVAA 



MLGKATCRCA SGFTGEDCQY STSHPCFVSR PCLNGGTCHM LSROT-YECT CQVGFTGKEC 
GVADYACSCA LGFSGPLCLT PLDNAC-LTN PCRNGGTCDL LT-LTEYKCR CPPGWSGKSC 
NAIDFICHCP VGFTDKVCLT PVDNAC-VNN PCRNGGTCEL LNSVTEYKCR CPPGWTGDSC 
GRPGISCKCP LGFOESLCE I AVPNAC-DHV TCLNGGTCQL KT-LEEYTCA CANGYTGERC 



NLPGSYQCQC PQGFTGQYCD SLYVPCAPSP CVNGGTCRQT GOFTFECNCL PGFEGSTCER 
NEVGSYRCVC RATHTGPNCE RPYVPCSPSP CQNGGTCRPT GDVTHECACL PGFTGQNCEE 
NEFGSYRCTC QNRFTGRNCD EPYVPCNPSP CLNGGTCRQT ODTSYDCTCL PGFSGQNCEE 
NTHGSYQCMC PTGYTGKDCO TKYNPCSPSP CQNAGICRSN G-LSYECKCP KGFEGKNCEQ 
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r 



< 



EGF-I ike Repeats 



QCRDGYEPCV NEGMCVTYHN GTGYCKCPEG FLGEYCQHRD PCE-KNRCQN GGTC— VAQA 83 

RCSQPGETCL NGGKCEA-AN GTEACVCGGA FVGPRCQDPN PCL-STPCKN AGTCHWDRR 80 

RCTQTAEMCL NGGRCEMTPG GTGVCLCGNL YFGERCQFPN PCT I KNQCMN FGTCEPVLQG 90 

SCTSVG— CQ NGGTCVTQLN GKTYCACDSH YVGDYCEHRN PCN-SMRCQN GGTCQVTFRN | 117 

QWTDACLSHP CANGSTCTTV — ANQFSCKC LTGFTGQKCE TDVNEC-DIP GHCQHGGTCL I 199 

QQADPCASNP CANGGQCLPF — EASYICHC PPSFHGPTCR QDVNECGQKP RLCRHGGTCH 196 

QQADPCASNP CANGGKCLPF — EIQYICKC PPGFHGATCK QDINEC-S-Q NPCKNGGQC I 195 

ETKNLCASSP CRNGATCTAL AGSSSFTCSC PPGFTGDTCS YD1EEC-Q-S NPCKYGG1CV | 233 

N 1 DDCPNHRC QNGGVCVDGV NTYNCRCPPQ WTGQFCTEDV DECLLQPNA- CQNGGTCANR I 318 

N 1 DDCPGNNC KNGGACVDGV NTYNCPCPPE WTGQYCTEDV OECQLMPNA- CQNGGTCHNT 315 

NIDDCPSNNC RNGGTCVDGV NTYNCQCPPD WTGQYCTEDV OECQLMPNA- CQNGGTCHNT 314 

NYDDCLGHLC QNGGTC I DG I SDYTCRCPPN FTGRFCQDDV OECAQRDHPV CQNGATCTNT 352 
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NGGYGCVCVN GWSGDDCSEN IDDCAFASCT PGSTCIDRVA SFSCMCPEGK AGLLCHLDDA 

HGGYNCVCVN GWTGEDCSEN IDDCASAACF HGATCHDRVA SFYCECPHGR TGLLCHLNDA 

YGGYNCVCVN GWTGEDCSEN IDDCANAACH SGATCHDRVA SFYCECPHGR TGLLCHLDNA 

HGSYSCICVN GWAGLDCSNN TDDCKQAACF YGATC I DGVG SFYCQCTKGK TGLLCHLDDA 



AFHCECLKGY AGPRCEMDIN ECHSDPCQND ATCLDKIGGF TCLCMPGFKG VHCELE INEC 
SFECQCLQGY TGPRCEIDVN ECVSNPCQND ATCLDQIGEF QCMCMPGYEG VHCEVNTDEC 
SFQCNCPQGY AGPRCEIDVN ECLSNPCQND STCLDQIGEF QCICMPGYEG LYCETNIDEC 
SYRCNCSQGF TGPRCETNIN ECESHPCQNE GSCLDDPGTF RCVCMPGFTG TQCE 1 D I DEC 



ATGFTGVLCE ENIDNCDPDP CHHGQCQDGI DSYTCICNPG YMGAICSDQI DECYSSPCLN 

TEGYTGTHCE VDIDECDPDP CHYGSCKDGV ATFTCLCRPG YTGHHCETNI NECSSQPCRL 

TEGFTGRHCE QDINECIPDP CHYGTCKDGI ATFTCLCRPG YTGRLCDND I NECLSKPCLN 

PPGYTGTSCE ININDCDSNP CHRGKC 1 DDV NSFKCLCDPG YTGYICQKQI NECESNPCQF 
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L. 
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CISNPCHKGA LCDTNPLNGQ YICTCPQGYK GADCTEDVDE CAMANSNPCE HAGKCVNTDG 438 

CISNPCNEGS NCDTNPVNGK AICTCPSGYT GPACSQDVDE CSLG-ANPCE HAGKCINTLG 434 

CISNPCNEGS NCDTNPVNGK AICTCPPGYT GPACNNDVDE CSLG-ANPCE HGGRCTNTLG 433 

CTSNPCHADA ICDTSPINGS YACSCATGYK GVDCSED1DE CDQG-SPCE HNG1CVNTPG | 470 

QSNPCVNNGQ CVDKVNRFQC LCPPGFTGPV CQIDIDDCSS TPCLNGAKC I DHPNGYECQC I 558 

ASSPCLHNGR CLDK I NEFQC ECPTGFTGHL CQYDVDECAS TPCKNGAKCL DGPNTYTCVC 554 

ASNPCLHNGK CIDKINEFRC DCPTGFSGNL CQHDFDECTS TPCKNGAKCL DGPNSYTCQC 553 

QSNPCLNDGT CHDK 1 NGFKC SCALGFTGAR CQIN1DDCQS QPCRNRG1CH DS1AGYSCEC [ 590 

DGRC I DLVNG YQCNCQPGTS GVNCE I NFDD CASNPCIHG- ICMDGINRYS CVCSPGFTGQ I 677 

RGTCQDPDNA YLCFCLKGTT GPNCE I NLDD CASSPCDSG- TCLDK I DGYE CACEPGYTGS 673 

GGXTDRENG YICTCPKGTT GVNCE TK1DD CASNLCDNG- KCIDKIDGYE CTCEPGYTGK 672 

DGHCQDRVGS YYCQCQAGTS GKNCEVNVNE CHSNPCNNGA TCIDGINSYK CQCVPGFTGQ 710 
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RCNIOIDECA SNPCRKGATC INGVNGFRCI CPEGPHHPSC YSQVNECLSN PCI-HGNCTG 
MCNSNIDECA GNPCHNGGTC EDGINGFTCR CPEGYHDPTC LSEVNECNSN PCV-HGACRD 
LCNININECD SNPCRNGGTC KDQINGFTCV CPDGYHDHMC LSEVNECNSN PC I -HGACHD 
HCEKNVDECI SSPCANNGVC IDQVNGYKCE CPRGFYDAHC LSOVDECASN PCVNEGRCED 



DECASNPCLN QGTCFDDISG YTCHCVLPYT GKNCQTVLAP CSPNPCENAA VCKESPNFES 

NECASNPCLN KGTCIDDVAG YKCNCLLPYT GATCEWLAP CAPSPCRNGG ECRQSEDYES 

NECSSNPCLN HGTCIODVAG YKCNCMLPYT GA1CEAVLAP CAGSPCKNGG RCKESEOFET 

DDCVTNPCGN GGTCIOKVNG YKCVCKVPFT GRDCESKMDP CASNRCKNEA KCTPSSNFLO 



> 



CLANPCQNGG SCMDGVNTFS CLCLPGFTGD KCQTONMECL SEPCKNGGTC SDYVNSYTCK 
CRPNPCHNGG SCTDG I NTAF COCLPGFRGT FCEEOINECA SDPCRNGANC TDCVDSYTCT 
CQPNPCHNGG SCSDGINMFF CNCPAGFRGP KCEEDINECA SNPCKNGANC TDCVNSYTCT 
CASFPCQNGG TCLDG IGDYS CLCVDGFDGK HCETDINECL SQPCQNGATC SQYVNSYTCT 



_J 



GLSGYKCLCD AGWVGINCEV DKNECLSNPC QNGGTCONLV NGYRCTCKKG FKGYNCQVNI 
SLNGYKCDCD PGWSGTNCDI NNNECESNPC VNGGTCKDMT SGIVCTCREG FSGPNCQTNI 
GVNGYKCDCE AGWSGSNCDI NNNECESNPC MNGGTCKDMT GAYICTCKAG FSGPNCQTNI 
GINEFICHCP PGYTGKRCEL DIDECSSNPC QHGGTCYDKL NAFSCQCMPG YTGQKCETNI 



YTCLCA-PGW QGQRCT1DID EC-ISKPCMN HGLCHNTQGS YMCECPPGFS GMDCEED1DD 
FSCVCPTAGA KGQTCEVDIN EC-VLSPCRH GASCQNTHGG YRCHCQAGYS GRNCETDIDD 
FSCECP-PGW QGQTCEIDMN EC-VNRPCRN GATCQNTNGS YKCNCKPGYT GRNCEMD1DD 
FSCTCK-LGY TGRYCDED I D ECSLSSPCRN GASCLNVPGS YRCLCTKGYE GRDCAINTDD 



CQAGFDGVHC ENNINECTES SCFNGGTCVO GINSFSCLCP VGFTGSFCLH EINECSSHPC 
CPAGFSGIHC ENNTPDCTES SCFNGGTCVD GINSFTCLCP PGFTGSYCQH WNECDSRPC 
CQPGFSGIHC ESNTPDCTES SCFNGGTC1D GINTFTCQCP PGFTGSYCQH DINECDSKPC 
CPLGFSGINC QTNDEDCTES SCLNGGSCID G I NGYNCSCL AGYSGANCQY KLNKCDSNPC 
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LNEGTCVDGL GTYRCSCPLG YTGKNCQTLV NLCSRSPCKN KGTCVQKKAE SQCLCPSGWA 
LLGGTCQDGR GLHRCTCPQG YTGPNCQNLV HWCDSSPCKN GGKCWQTHTQ YRCECPSGWT 
LNGGTCQDSY GTYKCTCPQG YTGLNCQNLV RWCDSSPCKN GGKCWQTNNF YRCECKSGWT 
LNGATCHEQN NEYTCHCPSG FTGKQCSEYV DWCGQSPCEN GATCSQMKHQ FSCKCSAGWT 



SNPCQHGATC SDF IGGYRCE CVPGYQGVNC EYEVDECQNQ PCQNGGTCID LVNHFKCSCP 
PSPCQNGATC TDYLGGYSCK CVAGYHGVNC SEE I DECLSH PCQNGGTCLD LPNTYKCSCP 
PNPCQNGATC TDYLGGYSCE CVAGYHGVNC SEE I NECLSH PCQNGGTCID LINTYKCSCP 
SQPCQNGGTC RDLIGAYECQ CRQGFQGQNC ELNIDDCAPN PCQNGGTCHD RVMNFSCSCP 



CLSNPCSSEG SLDC1QLTND YLCVCRSAFT GRHCETFVDV CPQMPCLNGG TCAVASNMPD 
CLSNPCDARG TQNCVQRVND FHCECRAGHT GRRCESVING CKGKPCKNGG TCAVASNTAR 
CLSNPCDSRG TQNCIQLVND YRCECRQGFT GRRCESWDG CKGMPCRNGG TCAVASNTER 
CLSNPCSNAG TLDCVQLVNN YHCNCRPGHM GRHCEHKVDF CAQSPCQNGG NCNI — RQS 
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C GAYCDVPNVS CDIAASRRGV LVEHLCQHSG VC I NAGNTHY CQCPLGYTGS YCEEQLDECA 
GLYCDVPSVS CEVAAQRQGV DVARLCQHGG LCVDAGNTHH CRCQAGYTGS YCEDLVDECS 
GVYCDVPSVS CEVAAKQQGV DIVHLCRNSG MCVDTGNTHF CRCQAGYTGS YCEEQVDECS 
GKLCDVQT I S CQDAADRKGL SLRQLC-NNG TCKDYGNSHV CYCSQGYAGS YCQKE I DECQ 



PGTRGLLCEE NIDDCAR- 



<5PHCLN GGQCMDRIGG YSCRCLPGFA GERCEGD1NE 



RGTQGVHCE I NVDDCNPPVD PVSRSPKCFN NGTCVDQVGG YSCTCPPGFV GERCEGDVNE 
RGTQGVHCE I NVDDCTPFYD SFTLEPKCFN NGKCIDRVGG YNCICPPGFV GERCEGDVNE 
PGTMGIICEI NKDDCKP GACHN NGSCIDRVGG FECVCQPGFV GARCEGDINE 



GFICRCPPGF SGARCQS — SCGQVKCRKG EQCVHTAS— GPRCFCPSP RDCES- 



GFICKCPAGF EGATCENDAR TCGSLRCLNG GTCISGPR— SPTCLCLGPF TGPECQFPAS 
GFICKCPPGF DGATCEYDSR TCSNLRCQNG GTCISVLT— SSKCVCSEGY TGATCQYPVI 
v GHHC1CNNGF YGKNCELSGQ DCDSNPCRVG -NCWADEGF GYRCECPRGT LGEHCEIDTL 
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-CC-ASSPCQ HGGSCHPQRQ PPYYSCQCAP PFSGSRCEL- -YTAPP S TPP 

SPCLGGNPCY NQGTCEPTSE SPFYRCLCPA KFNGLLCHIL DYSFGG — 

SPC-ASHPCY NGGTCQFFAE EPFFQCFCPK NFNGLFCHIL DYEFPG — 

DEC-SPNPCA QGAACEDLLG D— YECLCPS KWKGKRCD I Y DANYPGWNGG SGSGNORYAA 



-GAGRDIPPP 
-GLGKN I TPP 



NN-OCDELCN TVECLFDNFE CQGNSKTCK- -YDKYCADHF KDNHCNQGCN SEECGWDGLD 
SDGHCDSQCN SAGCLFDGFO CQRAEGQCNP LYDQYCKDHF SDGHCDQGCN SAECEWDGLD 
NDGKCDSQCN NTGCLYDGFD CQKVEVQCNP LYDQYCKDHF QDGHCDQGCN NAECEWDGLD 
KNGKCNEECN NAACHYDGHD CERKLKSCDS LFDAYCQKHY GDGFCDYGCN NAECSWDGLD 



-MTRRSL PGEQ E QEVAGSKVFL 



YYGEKSAAMK KQ— R 

YYGREEELRK HPIKRAAEGW AAPDALLGQV KASLLPGGSE GGRRRRELDP MDVRGS I VYL 

YYGNEEELKK HHIKRSTDYW SDAPSAI FSTMKESIL LGRHRRELDE MEVRGS I VYL 

WKDNVRVPEl EDTDFARKNK I LYTQQVHQ- — TGIQIYL 
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LNR (Notch/Lin-12 Repeots) 
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TCL SQYCADKARD GVCDEACNSH ACQWDGGDCS LTMENPWANC SSPLPCWDYI 
E LPECQEDAGN KVCSLQCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 
ICE NEQCSELADN KVCNANCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 
DLEQQRAMCD KRGCTEKQGN GICDSDCNTY ACNFDGNDCS LGI-NPWANC TAN-EXWNKF 



CAADQPEN-L AEGTLVIWL MPPEQLLQDA R-SFLRALGT LLHTNLRIKR DSQGELMVYP 
CAEHVPER-L AAGTL-WW LMPPEQLRNS SFHFLRELSR VLHTNWFKR DAHGQQMIFP 
C-ANMPEN-L AEGTLVLWL MPPERLKNNS V-NFLRELSR VLHTNWFKK DSKGEYK I YP 
ce}jktqspvl AEGAMSWML MNVEAFREIQ A-QFLRNMSH MLRTTVRLKK DALGHDIIIN 

TM 



E I DNRQCVQD SDHCFKNTDA AAALLASHAI QG — TLSYP LVSWSESLT PERT-Q-.LY 
E I DNRQCVQA SSQCFQSATD VAAFLGALAS LGSL-NIPYK IEAVQSETVE PPPPAQ-LHF 
E I DNRQCYKS SSQCFNSATD VAAFLGALAS LGSLDTLSYK I EAVKSENME TPKPST-LYP 
E I DNRKCTEC FTHAVEAAEF LAATAAKHQL RNDFQ-IHSV RGIKNPGDED NGEPPANj\/KY 
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LLAVAWIIL FIILLGVIMA 

MYVAAAAFVL LFFVGCGVLL 

MLSMLVIPLL I IFVFMMVIV 

VITG1ILVII ALAFFGMVL- 



KRKRK— HGS LWLPEGFTLR RDASNHKRRE PVGQDAVGLK 
SRKRRRQHGQ LWFPEGFKV- SEASKKKRRE ELGEDSVGLK 
NKKRRREHDS FGSPTALFQK NPA-KRNGET PW-EDSVGLK 
STQRKRAHGV TWFPEGFRAP AAVMSRRRRD PHGQEMRNLN 

CDC-10/Ankyrin Repeats 



PIORRPWTQQ HLEAAD I RRT PSLALTPPQA EQEVDVLDVN VRGPDGCTPL MLASLRGGSS 
QTDHRQWTQQ HLDAADL-RM SAMAPTPPQG EVDADCMDVN VRGPDGFTPL MIASCSGGGL 
KTDPRQWTRQ HLDAADL-RI SSMAPTPPQG E I EADCMDVN VRGPDGFTPL MIASCSGGGL 
EADQRVWSQA HLDWDV-R- AIM—TPP-A HQDGGKHDVD ARGPCGLTPL M1AAVRGGGL 



ANAQDNMGRC PLHAAVAADA QGVFQILIRN RVTDLDARMN DGTTPLILAA RLAVEGMVAE 
AN I QDNMGRT PLHAAVSADA QGVFQILIRN RATDLDARMH DGTTPLILAA RLAVEGMLED 
ANVQDNMGRT PLHAAVAADA QGVFQILIRN RATDLDARMF DGTTPLILAA RLAVEGMVEE 
ANCQDNTGRT PLHAAVAADA MGVFQILLRN RATNLNARMH DGTTPLILAA RLAIEGMVED 
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^"NLSVQVSEAN ligtgtsehw vdde- 

PLK-NASDGA LMDDNQNE-W GDED- 
PIK-NMTDGS FMDDNQNE-W GDEET- 
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£ PQPKKVKAED EALLSE-EDD 

- LETKKFRFEE PWLPD-LDD 

- LENKRFRFEE QVILPELVDD 



KQVAMQSQGV GQPGAH — W SDDESDMPLP KRQRSDPVSG VGLGNNGGYA SDHTMVSEYE 



DLSDEDEDAE DSSAN 1 1 TDL VYQGASLQAQ TDRTGEMALH LAARYSRADA AKRLLDAGAD 
ETGNSEEE-E DAPA-VISDF IYQGASLHNQ TDRTGETALH LAARYSRSDA AKRLLEASAD 
ETGNSEEE-E DASANMISDF IGQGAQLHNQ TDRTGETALH LAARYARADA AKRLLESSAD 
DTGEDIENNE DSTAQV I SDL LAQGAELNAT MDKTGETSLH LAARFARADA AKRLLDAGAD 



L I NCQADVNA VDDHGKSALH WAAAVNNVEA TLLLLKNGAN RDMQDNKEET PLFLAAREGS 
L I NSHADVNA VDDLGKSALH WAAAVNNVDA AWLLKNGAN KDMQNNREET PLFLAAREGS 
L I NAHADVNA VDEFGKSALH WAAAVNNVDA AAVLLKNSAN KDMQNNKEET SLFLAAREGS 
LITADADINA ADNSGKTALH WAAAVNNTEA VNILLMHHAN RDAQDDKDET PLFLAAREGS 
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GSAGSLSRLH PVPVPADW— MNRMEVNETQ YNEMFGMVLA PAEG-THPG I APQSRPPEGK 
GQCEWLSRLQ SGMVPNQYNP LRGSVAPGPL STQAPSLQHG -MVGPLHSSL AASALSQMMS 
SQCDWLARLQ NGMVQNQYDP IRNGIQQGN- AQQAQALQHG LMTS-LHNGL PATTLSQMMT 
PSLPTSPTHI QAMRHATQQK QFGGSNLNSL LGGANGGGW GGGGGGGGGV GQGPQNSPVS 

APQPQSTCPP AVAGPLPTMY QIP EM ARL-PSVAFP TAMMPQQDGQ VAQT I LPAYH 

PPQPHLGVSS AASGHLGRSF LSGEPSQADV QPLGPSSLAV HT I LPQ-ESP ALPTSLPSSL 
MQQQHHN-SS TTSTHINSPF CSSDISQTDL QQM—SSNNI HSVMPQ-DTQ IFAASLPSNL 
QQQLGGLEFG SAGLDLNG-F CGSPDSFHSG QMNPPS — 1 QSSMSG-SSP STNMLSPSSQ 



SDWSDVTTSP TPGGAGGGQR GPGTHMSEPPHNN MQVYA 

SDWSEGVSSP PT SMQ SQIARIPEAFK 
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GGCACATGCC ATATGCTCAG CCGGGATACC TATGAGTGCA CCTGTCAAGT CGGGTTTACA GGTAAGGAGT GCCAATGGAC GGATGCCTGC 
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TGTGAGACTG ATGTCAATGA GTGTGACATT CCAGGACACT GCCAGCATGG TGGCACCTGC CTCAACCTGC CTGGTTCCTA CCAGTGCCAG 
CET DVNE CDI PGH CQHG GTC LNL PGSY QCQ> 
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********* 

TCCTTCTCTT GCATGTGCCC AGAGGGGAAG GCAGGTCTCC TGTGTCATCT GGATGATGCA TGCATCAGCA ATCCTTGCCA CAAGGGGGCA 
SFS CMCP EGK AGL LCHL D 0 A CIS NPCH KGA> 
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CTGTGTGACA CCAACCCCCT AAATGGGCAA TATATTTGCA CCTGCCCACA AGGCTACAAA GGGGCTGACT GCACAGAAGA TGTGGATGAA 
LCD TNPL NGQ YIC TCPQ GYK GAD CTED VDE> 
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TGTGCCATGG CCAATAGCAA TCCTTGTGAG CATGCAGGAA AATGTGTGAA CACGGATGGC GCCTTCCACT GTGAGTGTCT GAAGGGTTAT 
CAM ANSN PCE HAG KCVN TDG AFH CECL KGY> 
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1540 1550 1560 1570 1580 1590 1600 1610 1620 
** ******* 
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CVD KVNR FQC LCP PGFT GPV CQI DIDD CSS> 
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* * * * * * * * * 

TACAGGTGTA CTTGCAAGAA GGGCTTTAAA GGCTATAACT GCCAGGTGAA TATTGATGAA TGTGCCTCAA ATCCATGCCT GAACCAAGGA 
YRC TCKF GFK GYN CQVN IDE CAS NPCL NQG> 



FIG.17C 



2440 2450 2460 2470 2480 2490 2500 2510 2520 
********* 

ACCTGCTTTG ATGACATAAG TGGCTACACT TGCCACTGTG TGCTGCCATA CACAGGCAAG AATTGTCAGA CAGTATTGGC TCCCTGTTCC 
TCF DDIS GYT CHC VLPY TGK NCQ TVLA PCS> 

2530 2540 2550 2560 2570 2580 2590 2600 2610 
********* 

CCAAACCCTT GTGAGAATGC TGCTGTTTGC AAAGAGTCAC CAAATTTTGA GAGTTATACT TGCTTGTGTG CTCCTGGCTG GCAAGGTCAG 
PNP CENA AVC KES PNFE SYT CLC APGW QGQ> 

2620 2630 2640 2650 2660 2670 2680 2690 2700 
********* 

CGGTGTACCA TTGACATTGA CGAGTGTATC TCCAAGCCCT GCATGAACCA TGGTCTCTGC CATAACACCC AGGGCAGCTA CATGTGTGAA 
RCT IDID ECI SKP CMNH GLC HNT QGSY MCE> 

2710 2720 2730 2740 2750 2760 2770 2780 2790 
********* 

TGTCCACCAG GCTTCAGTGG TATGGACTGT GAGGAGGACA TTGATGACTG CCTTGCCAAT CCTTGCCAGA ATGGAGGTTC CTGTATGGAT 
CPP GFSG MDC EED IDDC LAN PCQ NGGS CMD> 

2800 2810 2820 2830 2840 2850 2860 2870 2880 
********* 

GGAGTGAATA CTTTCTCCTG CCTCTGCCTT CCGGGTTTCA CTGGGGATAA GTGCCAGACA GACATGAATG AGTGTCTGAG TGAACCCTGT 
GVN TFSC LCL PGF TGDK CQT DMN ECLS EPC> 

2890 2900 2910 2920 2930 2940 2950 2960 2970 
********* 

AAGAATGGAG GGACCTGCTC TGACTACGTC AACAGTTACA CTTGCAAGTG CCAGGCAGGA TTTGATGGAG TCCATTGTGA GAACAACATC 
KNG GTCS DYV NSY TCKC QAG FDG VHCE NNI> 

2980 2990 3000 3010 3020 3030 3040 3050 3060 
********* 

AATGAGTGCA CTGAGAGCTC CTGTTTCAAT GGTGGCACAT GTGTTGATGG GATTAACTCC TTCTCTTGCT TGTGCCCTGT GGGTTTCACT 

NEC TESS CFN GGT CVDG INS FSC LCPV GFT> 

3070 3080 3090 3100 3110 3120 3130 3140 3150 
********* 

GGATCCTTCT GCCTCCATGA GATCAATGAA TGCAGCTCTC ATCCATGCCT GAATGAGGGA ACGTGTGTTG ATGGCCTGGG TACCTACCGC 
GSF CLHE I N E CSS HPCL NEC TCV DGLG TYR> 

3160 3170 3180 3190 3200 3210 3220 3230 3240 
* * * * * * * * * 

TGCAGCTGCC CCCTGGGCTA CACTGGGAAA AACTGTCAGA CCCTGGTGAA TCTCTGCAGT CGGTCTCCAT GTAAAAACAA AGGTACTTGT 

CSC PLGY TGK NCQ TLVN LCS RSP CKNK GTO 
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3250 

* 



3260 



3270 

* 



3280 

* 



3290 

* 



3300 

* 



3310 



3320 

* 



3330 

* 



GTTCAGAAAA AAGCAGAGTC CCAGTGCCTA TGTCCATCTG GATGGGCTGG TGCCTATTGT GACGTGCCCA ATGTCTCTTG TGACATAGCA 
VQK KAES QCL CPS GWAG AYC DVP NVSC DIA> 



3340 



3350 



3360 

* 



3370 

* 



3380 



3390 

* 



3400 
* 



3410 



3420 



GCCTCCAGGA GAGGTGTGCT TGTTGAACAC TTGTGCCAGC ACTCAGGTGT CTGCATCAAT GCTGGCAACA CGCATTACTG TCAGTGCCCC 
ASR RGVL VEH ICQ HSGV CIN AGN THYC QCP> 

3430 3440 3450 3460 3470 3480 3490 3500 3510 
********* 

CTGGGCTATA CTGGGAGCTA CTGTGAGGAG CAACTCGATG AGTGTGCGTC CAACCCCTGC CAGCAOGGGG CAACATGCAG TGACTTCATT 
LGY TGSY CEE QLO ECAS NPC QHG ATCS DFI> 



3520 3530 3540 3550 3560 3570 3580 3590 3600 
********* 

GGTGGATACA GATGCGAGTG TGTCCCAGGC TATCAGGGTG TCAACTGTGA GTATGAAGTG GATGAGTGCC AGAATCAGCC CTGCCAGAAT 
GGY RCEC VPG YQG VNCE YEV DEC QNQP CQN> 

3610 3620 3630 3640 3650 3660 3670 3680 3690 
********* 

GGAGGCACCT GTATTGACCT TGTGAACCAT TTCAAGTGCT CTTGCCCACC AGGCACTCGG GGCCTACTCT GTGAAGAGAA CATTGATGAC 
GGT CI0L VNH FKC SCPP GTR GLL CEEN IDD> 

3700 3710 3720 3730 3740 3750 3760 3770 3780 
********* 

TGTGCCCGGG GTCCCCATTG CCTTAATGGT GGTCAGTGCA TGGATAGGAT TGGAGGCTAC AGTTGTCGCT GCTTGCCTGG CTTTGCTGGG 
CAR GPHC LNG GQC MDRI GGY SCR CLPG F A G> 



3790 3800 3810 3820 3830 3840 3850 3860 3870 
********* 

GAGCGTTGTG AGGGAGACAT CAACGAGTGC CTCTCCAACC CCTGCAGCTC TGAGGGCAGC CTGGACTGTA TACAGCTCAC CAATGACTAC 
ERC EGDI NEC LSN PCSS EGS LDC I Q L T NDY> 

3880 3890 3900 3910 3920 3930 3940 3950 3960 
********* 

CTGTGTGTTT GCCGTAGTGC CTTTACTGGC CGGCACTGTG AAACCTTCGT CGATGTGTGT CCCCAGATGC CCTGCCTGAA TGGAGGGACT 
LCV CRSA FTG RHC ETFV DVC PQM PCLN GGT> 

3970 3980 3990 4000 4010 4020 4030 4040 4050 
********* 

TGTGCTGTGG CCAGTAACAT GCCTGATGGT TTCATTTGCC GTTGTCCCCC GGGATTTTCC GGGGCAAGGT GCCAGAGCAG CTGTGGACAA 
CAV ASNM POG FIC RCPP GFS GAR CQSS CGQ> 
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4060 4070 4080 4090 4100 4110 4120 4130 4140 
********* 

GTGAMTGTA GGAAGGGGGA GCAGTGTGTG CACACCGCCT CTGGACCCCG CTGCTTCTGC CCCAGTCCCC GGGACTGCGA GTCAGGCTGT 
VKC RKGE QCV HTA SGPR CFC PSP RDCE SGC> 




4150 4160 4170 4180 4190 4200 4210 4220 4230 
********* 

GCCAGTAGCC CCTGCCAGCA CGGGGGCAGC TGCCACCCTC AGCGCCAGCC TCCTTATTAC TCCTGCCAGT GTGCCCCACC ATTCTCGGGT 
ASS PCQH GGS CHP QRQP PYY SCQ CAPP FSG> 

4240 4250 4260 4270 4280 4290 4300 4310 4320 
********* 

AGCCGCTGTG AACTCTACAC GGCACCCCCC AGCACCCCTC CTGCCACCTG TCTGAGCCAG TATTGTGCCG ACAAAGCTCG GGATGGCGTC 
SRC ELYT APP STP PATC LSQ YCA OKAR OGV> 

4330 4340 4350 4360 4370 4380 4390 4400 4410 
* * * * * * * * * 

TGTGATGAGG CCTGCAACAG CCATGCCTGC CAGTGGGATG GGGGTGACTG TTCTCTCACC ATGGAGAACC CCTGGGCCAA CTGCTCCTCC 
C D E A C N S H A C Q W D G G D C S L T MEN P W A N C S S> 



4420 



4430 



4440 



4450 



4460 



4470 



4480 



4490 



4500 



CCACTTCCCT GCTGGGATTA TATCAACAAC CAGTGTGATG AGCTGTGCAA CACGGTCGAG TGCCTGTTTG ACAACTTTGA ATGCCAGGGG 
PLP CWDY INN QCO ELCN TVE CLF ONFE CQG> 



4510 4520 4530 4540 4550 4560 4570 4580 4590 
********* 

AACAGCAAGA CATGCAAGTA TGACAAATAC TGTGCAGACC ACTTCAAAGA CAACCACTGT AACCAGGGGT GCAACAGTGA GGAGTGTGGT 
SK TCKY DKY CAD HFKD NHC NQG CNSE ECG> 



4600 4610 4620 4630 4640 4650 4660 4670 4680 

* * * * * * * * * 

TGGGATGGGC TGGACTGTGC TGCTGACCAA CCTGAGAACC TGGCAGAAGG TACCCTGGTT ATTGTGGTAT TGATGCCACC TGAACAACTG 
WOG LDCA ADQ PEN LAEG TLV IVV LMPP EQL> 

4690 4700 4710 4720 4730 4740 4750 4760 4770 

* * * » * * * * » 

CTCCAGGATG CTCGCAGCTT CTTGCGGGCA CTGGGTACCC TGCTCCACAC CAACCTGCGC ATTAAGCGGG ACTCCCAGGG GGAACTCATG 
LQD ARSFLRA LGT LLHT NLR IKR DSQG ELM> 



4780 4790 4800 4810 4820 4830 4840 4850 4860 
********* 

GTGTACCCCT ATTATGGTGA GAAGTCAGCT GCTATGAAGA AACAGAGGAT GACACGCAGA TCCCTTCCTG GTGAACAAGA ACAGGAGGTG 
VYP YYGE KSA A II K KQRM TRR SIP GEQE QEV> 
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4870 4880 4890 4900 4910 4920 4930 4940 4950 
********* 

GCTGGCTCTA AAGTCTTTCT GGAAATTGAC AACCGCCAGT GTGTTCAAGA CTCAGACCAC TGCTTCAAGA ACACGGATGC AGCAGCAGCT 
AGS KVFL EID NRQ CVQD SDH CFK NTDA A A A> 

4960 4970 4980 4990 5000 5010 5020 5030 5040 

CTCCTGGCCT CTCACGCCAT ACAGGGGACC CTGTCATACC CTCTTGTGTC TGTCGTCAGT GAATCCCTGA CTCCAGAACG CACTCAGCTC 
LLA SHAI QGT LSY PLVS VVS ESL TPER TQL> 

5050 5060 5070 5080 5090 5100 5110 5120 5130 
********* 

CTCTATCTCC TTGCTGTTGC TGTTGTCATC ATTCTGTTTA TTATTCTGCT GGGGGTAATC ATGGCAAAAC GAAAGCGTAA GCATGGCTCT 
LYL LAVA VVI 1LF IILL GVI MAKRKRK HGS> 

5140 5150 5160 5170 5180 5190 5200 5210 5220 
********* 

CTCTGGCTGC CTGAAGGTTT CACTCTTCGC CGAGATGCAA GCAATCACAA GGGTCGTGAG CCAGTGGGAC AGGATGCTGT GGGGCTGAAA 
LWL PEGF TLR RDA SNHK RRE PVG QDAV GLK> 

5230 5240 5250 5260 5270 5280 5290 5300 5310 
********* 

AATCTCTCAG TGCAAGTCTC AGAAGCTAAC CTAATTGGTA CTGGAACAAG TGAACACTGG GTCGATGATG AAGGGCCCCA GCCAAAGAAA 
NLS VQVS EAN LIG TGTS EHW VDD EGPQ PKK> 

5320 5330 5340 5350 5360 5370 5380 5390 5400 
********* 

GTAAAGGCTG AAGATGAGGC CTTACTCTCA GAAGAAGATG ACCCCATTGA TCGACGGCCA TGGACACAGC AGCACCTTGA AGCTGCAGAC 
V K A EOEA LLS EED DPIO RRP WTQ QHLE AAO> 

5410 5420 5430 5440 5450 5460 5470 5480 5490 
********* 

ATCCGTAGGA CACCATCGCT GGCTCTCACC CCTCCTCAGG CAGAGCAGGA GGTGGATGTG TTAGATGTGA ATGTCCGTGG CCCAGATGGC 
IRR TPSL ALT PPQ AEQE VOV LOV NVRG PDG> 

5500 5510 5520 5530 5540 5550 5560 5570 5580 
********* 

TGCACCCCAT TGATGTTGGC TTCTCTCCGA GGAGGCAGCT CAGATTTGAG TGATGAAGAT GAAGATGCAG AGGACTCTTC TGCTAACATC 
CTP LMLA SLR GGS SDLS DEO EDA EDSS ANI> 

5590 5600 5610 5620 5630 5640 5650 5660 5670 
********* 

ATCACAGACT TGGTCTACCA GGGTGCCAGC CTCCAGGCCC AGACAGACCG GACTGGTGAG ATGGCCCTGC ACCTTGCAGC CCGCTACTCA 
I TO LVYQ GAS LQA QTDR TGE MAL HLAA RYS> 
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5680 5690 5700 5710 5720 5730 5740 5750 5760 

* * * * * * * * * 

CGGGCTGATG CTGCCAAGCG TCTCCTGGAT GCAGGTGCAG ATGCCAATGC CCAGGACAAC ATGGGCCGCT GTCCACTCCA TGCTGCAGTG 
RAD AAKR LLD AGA DANA QDN MGR CPLH A A V> 

5770 5780 5790 5800 5810 5820 5830 5840 5850 
********* 

GCAGCTGATG CCCAAGGTGT CTTCCAGATT CTGATTCGCA ACCGAGTAAC TGATCTAGAT GCCAGGATGA ATGATGGTAC TACACCCCTG 
A A 0 AQGV FQI LIR NRVT DLD ARM NDGT TPL> 

5860 5870 5880 5890 5900 5910 5920 5930 5940 
********* 

ATCCTGGCTG CCCGCCTGGC TGTGGAGGGA ATGGTGGCAG AACTGATCAA CTGCCAAGCG GATGTGAATG CAGTGGATGA CCATGGAAAA 
I L A ARLA VEG II V A EL1N CQA DVN AVDD HGK> 

5950 5960 5970 5980 5990 6000 6010 6020 6030 
********* 

TCTGCTCTTC ACTGGGCAGC TGCTGTCAAT AATGTGGAGG CAACTCTTTT GTTGTTGAAA AATGGGGCCA ACCGAGACAT CCAGGACAAC 
SAL HWAA A V N NVE ATLL LLK N G A NRDM QDN> 

6040 6050 6060 6070 6080 6090 6100 6110 6120 

* * * * * * * * * 

AAGGAAGAGA CACCTCTGTT TCTTGCTGCC CGGGAGGGGA GCTATGAAGC AGCCAAGATC CTGTTAGACC ATTTTGCCAA TCGAGACATC 
KEE TPLF LAA REG SYEA AKI LLD HFAN RD1> 

6130 6140 6150 6160 6170 6180 6190 6200 6210 

* * * * * * * . » $ 

ACAGACCATA TGGATCGTCT TCCCCGGGAT GTGGCTCGGG ATCGCATGCA CCATGACATT GTGCGCCTTC TGGATGAATA CAATGTGACC 
TDH MDRL PRO VAR DRMH H D I VRL LDEY NVT> 

6220 6230 6240 6250 6260 6270 6280 6290 6300 
******** * 

CCAAGCCCTC CAGGCACCGT GTTGACTTCT GCTCTCTCAC CTGTCATCTG TGGGCCCAAC AGATCTTTCC TCAGCCTGAA GCACACCCCA 
PSP PGTV LTS ALS PV1C GPN RSF LSLK HTP> 

6310 6320 6340 6350 6360 6370 6380 6390 6400 
********* 

ATGGGCAAGA AGTCTAGACG GCCCAGTGCC AAGAGTACCA TGCCTACTAG CCTCCCTAAC CTTGCCAAGG AGGCAAAGGA TGCCAAGGGT 
MGK KSRR PSA KST MPTS LPN LAK EAKD A K G> 

6400 6410 6420 6430 6440 6450 6460 6470 6480 
********* 

AGTAGGAGGA AGAAGTCTCT GAGTGAGAAG GTCCAACTGT CTGAGAGTTC AGTAACTTTA TCCCCTGTTG ATTCCCTAGA ATCTCCTCAC 
SRR KKSL SEK VQL SESS VTL SPV DSLE SPH> 




FIG.17H 



6490 6500 6510 6520 6530 6540 6550 6560 6570 
♦ * * * * * * * * 

ACGTATGTTT CCGACACCAC ATCCTCTCCA ATGATTACAT CCCCTGGGAT CTTACAGGCC TCACCCAACC CTATGTTGGC CACTGCCGCC 
TYV SDTT SSP MIT SPGI LQA SPN PMLA T A A> 

6580 6590 6600 6610 6620 6630 6640 6650 6660 
********* 

CCTCCTGCCC CAGTCCATGC CCAGCATGCA CTATCTTTTT CTAACCTTCA TGAAATGCAG CCTTTGGCAC ATGGGGCCAG CACTGTGCTT 
PPA PVHA QHA LSF SNLH EMQ PLA HGAS TVL> 

6670 6680 6690 6700 6710 6720 6730 6740 6750 
********* 

CCCTCAGTGA GCCAGTTGCT ATCCCACCAC CACATTGTGT CTCCAGGCAG TGGCAGTGCT GGAAGCTTGA GTAGGCTCCA TCCAGTCCCA 
PSV SQLL SHH HIV SPGS GSA GSL SRLH PVP> 

6760 6770 6780 6790 6800 6810 6820 6830 6840 
********* 

GTCCCAGCAG ATTGGATGAA CCGCATGGAG GTGAATGAGA CCCAGTACAA TGAGATGTTT GGTATGGTCC TGGCTCCAGC TGAGGGCACC 
VPA DWMN RME VNE TQYN EMF G II V LAPA EGT> 

6850 6860 6870 6880 6890 6900 6910 6920 6930 

********* 

CATCCTGGCA TAGCTCCCCA GAGCAGGCCA CCTGAAGGGA AGCACATAAC CACCCCTCGG GAGCCCTTGC CCCCCATTGT GACTTTCCAG 
HPG IAPQ SRP PEG KHIT TPR EPL PPIV TFQ> 

6940 6950 6960 6970 6980 6990 7000 7010 7020 
********* 

CTCATCCCTA AAGGCAGTAT TGCCCAACCA GCGGGGGCTC CCCAGCCTCA GTCCACCTGC CCTCCAGCTG TTGCGGGCCC CCTGCCCACC 
LIP KGSI AQP AGA PQPQ STC PPA VAGP LPT> 

7030 7040 7050 7060 7070 7080 7090 7100 7110 
********* 

ATGTACCAGA TTCCAGAAAT GGCCCGTTTG CCCAGTGTGG CTTTCCCCAC TGCCATGATG CCCCAGCAGG ACGGGCAGGT AGCTCAGACC 
MYQ IPEM ARL PSV AFPT AMM PQQ DGQV A Q T> 

7120 7130 7140 7150 7160 7170 7180 7190 7200 

********* 

ATTCTCCCAG CCTATCATCC TTTCCCAGCC TCTGTGGGCA AGTACCCCAC ACCCCCTTCA CAGCACAGTT ATGCTTCCTC AAATGCTGCT 
ILP AYHP FPA SVG KYPT PPS QHS YASS N A A> 

7210 7220 7230 7240 7250 7260 7270 7280 7290 
********* 

GAGCGAACAC CCAGTCACAG TGGTCACCTC CAGGGTGAGC ATCCCTACCT GACACCATCC CCAGAGTCTC CTGACCAGTG GTCAAGTTCA 
ERT PSHS GHL QGE HPYL TPS PES PDQW SSS> 
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7300 7310 7320 7330 7340 7350 7360 7370 7380 
* * * * * * * » * 

TCACCCCACT CTGCTTCTGA CTGGTCAGAT GTGACCACCA GCCCTACCCC TGGGGGTGCT GGAGGAGGTC AGCGGGGACC TGGGACACAC 
SPH SASD WSD VTT SPTP GGA GGG QRGP GTH> 




7390 7400 7410 7420 7430 7440 7450 7460 7470 
********* 

ATGTCTGAGC CACCACACAA CAACATGCAG GTTTATGCGT GAGAGAGTCC ACCTCCAGTG TAGAGACATA ACTGACTTTT GTAAATGCTG 
MSE PPHN NMQ VYA> 

7480 7490 7500 7510 7520 7530 7540 7550 7560 
********* 

CTGAGGAACA AATGAAGGTC ATCCGGGAGA GAAATGAAGA AATCTCTGGA GCCAGCTTCT AGAGGTAGGA AAGAGAAGAT GTTCTTATTC 

7570 7580 7590 7600 7610 7620 7630 7640 7650 
********* 

AGATAATGCA AGAGAAGCAA TTCGTCAGTT TCACTGGGTA TCTGCAAGGC TTATTGATTA TTCTAATCTA ATAAGACAAG TTTGTGGAAA 



7660 7670 7680 7690 7700 7710 7720 7730 7740 
********* 

TGCAAGATGA ATACAAGCCT TGGGTCCATG TTTACTCTCT TCTATTTGGA GAATAAGATG GATGCTTATT GAAGCCCAGA CATTCTTGCA 



7750 



7760 
* 



7770 
* 



7780 
* 



7790 
* 



7800 

* 



7810 



7820 



7830 
* 



GCTTGGACTG CATTTTAAGC CCTGCAGGCT TCTGCCATAT CCATGAGAAG ATTCTACACT AGCGTCCTGT TGGGAATTAT GCCCTGGAAT 

7840 7850 7860 7870 7880 7890 7900 7910 7920 
********* 

TCTGCCTGAA TTGACCTACG CATCTCCTCC TCCTTGGACA TTCTTTTGTC TTCATTTGGT GCTTTTGGTT TTGCACCTCT CCGTGATTGT 

7930 7940 7950 7960 7970 7980 7990 8000 8010 
********* 

AGCCCTACCA GCATGTTATA GGGCAAGACC TTTGTGCTTT TGATCATTCT GGCCCATGAA AGCAACTTTG GTCTCCTTTC CCCTCCTGTC 

8020 8030 8040 8050 8060 8070 8080 8090 8100 
********* 

TTCCCGGTAT CCCTTGGAGT CTCACAAGGT TTACTTTGGT ATGGTTCTCA GCACAAACCT TTCAAGTATG TTGTTTCTTT GGAAAATGGA 

8110 8120 8130 8140 8150 8160 8170 8180 8190 
********* 

CATACTGTAT TGTGTTCTCC TGCATATATC ATTCCTGGAG AGAGAAGGGG AGAAGAATAC TTTTCTTCAA CAAATTTTGG GGGCAGGAGA 



8200 8210 8220 8230 8240 8250 8260 8270 8280 
********* 

TCCCTTCAAG AGGCTGCACC TTAATTTTTC TTGTCTGTGT GCAGGTCTTC ATATAAACTT TACCAGGAAG AAGGGTGTGA GTTTGTTGTT 
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8290 8300 8310 8320 8330 8340 8350 8360 8370 
********* 

TTTCTGTGTA TGGGCCTGGT CAGTGTAAAG TTTTATCCTT GATAGTCTAG TTACTATGAC CCTCCCCACT TTTTTAAAAC CAGAAAAAGG 



8380 



8390 



8400 



8410 



8420 



8430 



8440 



8450 



8460 




TTTGGAATGT TGGAATGACC AAGAGACAAG TTAACTCGTG CAAGAGCCAG TTACCCACCC ACAGGTCCCC CTACTTCCTG CCAAGCATTC 

8470 8480 8490 8500 8510 8520 8530 8540 8550 
********* 

CATTGACTGC CTGTATGGAA CACATTTGTC CCAGATCTGA GCATTCTAGG CCTGTTTCAC TCACTCACCC AGCATATGAA ACTAGTCTTA 

8560 8570 8580 8590 8600 8610 8620 8630 8640 
* * * * * * * * * 

ACTGTTGAGC CTTTCCTTTC ATATCCACAG AAGACACTGT CTCAAATGTT GTACCCTTGC CATTTAGGAC TGAACTTTCC TTAGCCCAAG 



8650 
* 



8660 
* 



8670 
* 



8680 
* 



8690 
* 



8700 
* 



8710 

* 



8720 



8730 
* 



GGACCCAGTG ACAGTTGTCT TCCGTTTGTC AGATGATCAG TCTCTACTGA TTATCTTGCT GCTTAAAGGC CTGCTCACCA ATCTTTCTTT 

8740 8750 8760 8770 8780 8790 8800 8810 8820 
********* 

CACACCGTGT GGTCCGTGTT ACTGGTATAC CCAGTATGTT CTCACTGAAG ACATGGACTT TATATGTTCA AGTGCAGGAA TTGGAAAGTT 

8830 8840 8850 8860 8870 8880 8890 8900 8910 
* * * * * * * * * 

GGACTTGTTT TCTATGATCC AAAACAGCCC TATAAGAAGG TTGGAAAAGG AGGAACTATA TAGCAGCCTT TGCTATTTTC TGCTACCATT 

8920 8930 8940 8950 8960 8970 8980 8990 9000 
********* 

TCTTTTCCTC TGAAGCGGCC ATGACATTCC CTTTGGCAAC TAACGTAGAA ACTCAACAGA ACATTTTCCT TTCCTAGAGT CACCTTTTAG 

9010 9020 9030 9040 9050 9060 9070 9080 9090 
********* 

ATGATAATGG ACAACTATAG ACTTGCTCAT TGTTCAGACT GATTGCCCCT CACCTGAATC CACTCTCTGT ATTCATGCTC TTGGCAATTT 

9100 9110 9120 9130 9140 9150 9160 9170 9180 
********* 

CTTTGACTTT CTTTTAAGGG CAGAAGCATT TTAGTTAATT GTAGATAAAG AATAGTTTTC TTCCTCTTCT CCTTGGGCCA GTTAATAATT 



9190 9200 9210 9220 9230 9240 9250 9260 9270 
* * * * * * * * * 

GGTCCATGGC TACACTGCAA CTTCCGTCCA GTGCTGTGAT GCCCATGACA CCTGCAAAAT AAGTTCTGCC TGGGCATTTT GTAGATATTA 
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9280 9290 9300 9310 9320 9330 9340 9350 9360 
********* 

ACAGGTGAAT TCCCGACTCT TTTGGTTTGA ATGACAGTTC TCATTCCTTC TATGGCTGCA AGTATGCATC AGTGCTTCCC ACTTACCTGA 

9370 9380 9390 9400 9410 9420 9430 9440 9450 
* * * * * * * * * 

TTTGTCTGTC GGTGGCCCCA TATGGAAACC CTGCGTGTCT GTTGGCATAA TAGTTTACAA ATGGTTTTTT CAGTCCTATC CAAATTTATT 

9460 9470 9480 9490 9500 9510 9520 9530 9540 
********* 

GAACCAACAA AAATAATTAC TTCTGCCCTG AGATAAGCAG ATTAAGTTTG TTCATTCTCT GCTTTATTCT CTCCATGTGG CAACATTCTG 

9550 9560 9570 9580 9590 9600 9610 9620 9630 
******* ** 

TCAGCCTCTT TCATAGTGTG CAAACATTTT ATCATTCTAA ATGGTGACTC TCTGCCCTTG GACCCATTTA TTATTCACAG ATGGGGAGAA 

9640 9650 9660 9670 9680 9690 9700 9710 9720 
********* 
CCTATCTGCA TGGACCCTCA CCATCCTCTG TGCAGCACAC ACAGTGCAGG GAGCCAGTGG CGATGGCGAT GACTTTCTTC CCCTGGGAAT 



TCC 
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